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I.  INTRODUCTION 

The  proposed  BASIC  program  -for  smoothing  NUWES  data  uses  the  Least- 
Squares  Method  to  fit  a  polynomial  to  a  7-point  data  segment.   It  determines 
the  best  -fitting  polynomial  of  order   1,  2,  or  3  and  establishes  an  estimate 
for  the  coordinate  at  the  time  in  the  center  of  the  data  segment.    It  also 
establishes  a  Fi  gure-of-Mer  i  t  (FM)  -for  that  estimate. 

This  program  is  designed  specifically  -for  treatment  of  outliers  and 
nissing  points  but  it  can  also  be  used  -for  smoothing  of  other  observations. 
It  is  intended  to  be  used  as  the  basic  smoothing  program  for  NUWES  data  and 
to  provide  a  value  of  FM  for  each  observation  time  for  feedback  to  the 
Instrumentation  Department.   Values  of  the  FM ' s  also  provide  information 
for  other  users  of  the  smoothed  data  on  it's  quality  (i.e. ,  how  well  the 
smoothed  values  can  be  expected  to  represent  the  actual  path  of  a  vehicle.) 

This  report  is  intended  to  supplement  the  Project  Report  entitled 
"A  Figure  of  Merit  for  NUWES  Data".   (Ref.  7) 

II.  DATA  PREPARATION 

It  is  assumed  that  the  input  data  has  been  listed  in  the  following 
format 

Time,  Array,  x  comp  ,  y  camp  ,  z  comp. 

Each  component  (x,y,z)  is  smoothed  separately  and  the  Array  identification 
is  not  used  in  the  smoothing  process  so  a  separate  file  can  be  constructed 
for  each  component  with  the  Array  column  replaced  by  a  column  for 
identification  of  Quest i onabl e  Points  (QP),  i.e.,  for  identification  of 
hissing  Points  and  Outliers.   The  identification  numbers  are    specified  with 
Vray  A (6,5)  which  is  presented  in  the  next  section  of  this  report.   The 
proposed  input  format  is 

Time,  QP ,  XO 

rthere  XO  is  the  observed  value  of  x  at  the  corresponding  observation  time, 
fwo  preliminary  steps  must  be  taken  before  the  smoothing  can  be  performed. 
Fhese  arei 

A.  Treatment  of  Missing  Points.   Since  the  program  is  designed  to  treat 
7-point  data  segments  containing  no  missing  points,  the  data   must   be 
screened  and  temporary  values  provided  at  each  missing  point.  These 
■.emporary  values  are  established  by  taking  a  linear  average  of  adjacent 
)bser  vat  i  on  a  1  va  1  Lies  .   Th  e  f  o  1  1  owi  n  g  f  or  mu  1  ae  are    used  t  o  sup  p  1  y  t  h  e 
:emporary  values: 

1.   For   a   single  missing  value  at  time  t.  when  there  are 
observed  values  xo(t-l)  and  xo(t+l)  the  temporary  value  is 

xo(t)  ~     (xo(t-l)  +  xo(t+l))/2. 


2.  For  adjacent  missing  values  xo(t)  and  xo(t  +  l)  when  there  are 
observed  values  xo(t-l)  and  xo(t+2)  the  temporary  values  are 

xo(t)  ■  (2*xo(t-l)  +  xo(t+2) ) /3, 
;:o(t  +  l)  =  (xo(t-l)  +  2*xo < t+2) ) /3. 

3.  For  three  adjacent  missing  values  xo(t-l),  xo(t),  and  xo(t+l) 
when  there  are  observed  values  xo(t-2)  and  xo(t+2)  the  temporary  values 
are 

xo(t-l)  ■  (3*xo(t-2)  +  xo(t+2>>/4, 

xo(t)  =  <xo(t-2>  +  xo(t+2)>/2, 
xo(t+l)  =  (xo(t-2)  +  3*xo(t+2))/4. 

4.  For   -four   adjacent   missing   values   xo(t-t)  ,   xo(t)  , 
xo(t+l),  and  ::o(t+2)  when  there  are    observations  xo(t-2)  and  xo(t+3) 
the  temporary  values  are 

xo(t-l)  -  (4*xo(t~2)  +  xo(t+3))/5, 

xo(t)  =  (3*xo(t-2)  +■  2*xo(t+3)  )  /5, 

xo(t+l>  =  <2*xo(t-2>  +  3*xo(t+3))/5, 

xo(t+2)  -  (xo(t-2)  +  4*xo(t+3) ) /5. 

5.  When  there  are    more  than  four  adjacent  missing  values  in  a  7- 
point  data  segment  the  Least-Squares  Method  cannot  be  used  to  establish! 
an  estimate  for  the  midpoint  of  the  segment.    (It  could  still  be 
possible,  in  some  cases,  to  smooth  at  other  points  than  the  midpoint,  of 
the  segment.   'This  has  not  been  considered  as  it  would  require 
considerable  extension  of  the  smoothing  program  and  its  desirability 
would  be  questionable.) 

B.   Identification  of  Outliers.   The  procedure  for  identification  o-t 
Outliers  involves  the  use  of  Sequential  Differences  (Ref.2  and  Ref.5).  It 
will  not  be  discussed  in  detail  here.    A  Fourth  Order  difference  of 
magnitude  50  or  greater  is  considered  an  indicator  that  the  corresponding 
value  of  xo  is  a  potential  outlier.    When  there  are  adjacent  potential 
outliers,  the  one  with  the  greatest  magnitude  of  Fourth  Order  difference  is 
identified  as  the  actual  outlier  and  assigned  the  appropriate  number  in  the 
second  column  of  Array   A(6,5).   After  smoothing  this  outlier,  it  might  be 
prudent,  to  recalculate  the  Sequential  Differences  with  the  outlier  replaced  I 
by  its  smoothed  value  xe(t)  ho  give  assurance  that  the  adjacent  potential 
outliers  ar&    no  longer  identified  as  such.   Cases  where  adjacent,  potential 
outliers  will  also  be  identified  as  outliers  are,  hopefully,  rare    and  should) 
b  e  f 1 ag  g ed  for  special  treat  men  t . 


III.   THE  SMOOTHING  PROGRAM 


A.  ARRAYS 


The  program  presented  in  this  section  was  written  in  the  BASIC  Language 
•for  use  on  an  IBM  PC.  It  uses  6  Arrays  which  will  be  discussed  in  the  order 
that  they  appear  in  the  program. 


The  -first  Array  to  be  used  is  A  (6, 5).    It  i 

ARRAY  A (6,5) 


shown  below 


i\j 

0 

1 
2 

4 
5 


0 

T 


-1 

0 
+  1 
+2 


1 
QP 
0) 
-1) 
-2) 
-10) 
-11) 
-12) 


X0 
>:o<-3) 
x o ( -2 ) 
x  o  ( --  1  ) 
x  o  (  0 ) 
x  o  (  h-  1  ) 
xo(+2) 
xa<+3) 


XM 
xm( -3) 
xm(-2) 
x  m  (  - 1  ) 
x  m  (  0 ) 

'  x  m  <  + 1 ) 
xm(+2> 
xm  (+3) 


4 
XE 
xe(-3) 
>ce(-2> 
x  e  ( - 1 ) 
x  e  (  0 ) 
x  e  (  +  1  ) 
xe (+2) 
xe (+3) 


RES 
r  (-3) 
r  (-2) 
r  (-1  ) 
r  <  0 ) 
r  (  + 1  ) 
r  (-1-2) 
r  (+3) 


QUESTIONABLE  POINTS  (QP) 

0  xo  is  legitimate  observed  value 

-1  xo  is  unscheduled  missing  point 

-2  xo  is  scheduled  missing  point 

—10  xo  is  identified  as  an  outlier 

-11  xo  is  an  unscheduled  missing  point  and  an  outlier 

-12  xo  is  a  scheduled  missing  point  and  an  outlier 

Smoothing  of  a  7— paint  data  segment  is  performed  to  establish  the 
estimate  and  FM  for  the  midpoint  of  the  segment.   The  times  in  the  -first 
bolumn  o-f  Array  A  is  obtained  by  subtracting  the  time  o-f  this  midpoint  -from 
the  observation  times  of  the  di-ita  segment  so  that  column  1  of  this  array 
Jrti 1 1  always  be  the  same  for  all  data  segments.   The  second  column  of  Array 
I  p  r  o  v  i  d  e  s  t  h  e  i  d  e  n  t  i  f  i  c  a  t  i  o  n  of  t  h  e  03  P  '  s .   A  s  e  p  a  r  a  t  e  i  d  e  n  tificati  o  n  for 
scheduled  missing  points  may  be  omitted  but  is  included   for  potential 
future  use.   The  XO's  (column  3)  ^r<a    the  input  observed   values   including 
Iphe  temporary  values  for  the  missing  paints.   The  other  columns  of  this 
array  will  be  discussed  as  they  appear  in  the  smoothing  program. 

Array  T(5)  provides  the  table  values  of  the  Student— T  distribution 
"hat  a.rs    needed  to  establish  the  Fi  gur  e-of -Mer  i  t  (FM)  which  accompanies  the 
estimate  xe  at  the  midpoint  of  the  data  segment.   As  discussed  in  Reference 
I7,  the  number  'infinity'  cannot  be  used  by  a  computer  and  hence  it  is 
[arbitrarily  replaced  by  the  number  99.99  for  T(0).   This  value  is  large 
?nough  to  produce  a  large  value  of  FM  and  hence  to  indicate  low  quality  of 
:he  estimate  it  accompanies.   The  argument  j  in  T(j)  ranges  from  0  to  5  and 
.s  the  appropriate  number  of  ' degrees-of— freedom '  produced  in  Array  8(2,3). 
"his  will  be  discussed  later. 


Array  T(5)  is  shown  below. 


ARRAY  T<5) 


j       0         1         2        3        4        5 
T(j)    99.99    6.314    2.920    2.353    2.132    2.105 


ARRAY  B(2,3) 


\j 

0 

1 

<•) 

3 

K 

DFK 

SDRK 

FMK 

0 

1 

DF1 

SDR1 

FM1 

1 

2 

DF2 

SDR2 

FM2 

■-> 

DF3 

SDR3 

FM3 

In  this  array  K  is  the  order  of  the  polynomial  being   considered  (k~l  -for 
linear,  k=2  -for  quadratic,  and  k  =  3  for  cubic).   DFK  is  the  'degrees  o-f 
freedom'  available  for  fitting  a  polynomial  of  degree  k  and  is  established 
by  first  determining  the  number  NS  (element  C(3)  in  Array  C(8))  and  then 
reducing  this  number  by  the  number  of  parameters  (k  +  1)  in  the  polynomial. 
Thus 

IMS  =  N  -  (number  of  QP '  s  in  column  A(l,j)) 

=   C(3)  -  (number  of  negative  entries  in  column 
A(l  ,  j)  ) 

with,  initially, 

C(3)  -    7    -    number  of  points  in  the  data  segment. 

Columns  2  and  3  are 

B(i,2)  =  estimate  of  the  standard  deviation  of  the  noise 
in  the  observations  (the  xo's)  when  a  polynomial 
of  order  k  is  fitted  to  the  data  segment 

and 

B(i,3)  =  the  Fi  gure— of  --Mer  i  t  for  the?  estimate  ::e(0). 

ARRAY  3(5) 

j      0      1      2      3      4      5 
S  ( j  )    ::av    ss>:    si:-:    s2x    s3x    s3x  1 

The  element  S(0)=:-;av  is  the  average  of  the  xo's.   It  is  needed  to 
produce  the  xm's  (column  3  in  Array  A)„   The  other  elements  of  S  are  used  t 
calculate  the  values  for  the  SDRK. '  s  and  the  FMK  '  s  in  Array  B.  The  elements 
of  this  array  must  be  zeroed  at   the  start  of  each  smoothing  cycle  (line  29 
of  the  program)  since  they  contain  cumulations  of  product  of  the  data. 


ARRAY  D(3) 

j      0      1      2      3 
D(j)    bkO    bkl    bk2    bk3 

This  array  is  for  the  coefficients  of  the  polynomial  with  the  order  k. 
selected  to  fit  the  data  segment.  Its  elements  must  be  zeroed  before  each 
iteration. 

ARRAY  C(8) 

J0  123  456  73 

C(j)  T  QP  XO  NS  ITER  K  XE  RES  FM 

This  array  provides  the  final  output  of  the  smoothing  program.   In 
addition  to  the  time  (C(0))  for  which  the  smoothing  was  performed  and  the 
observed  or  temporary  value  of  ;:o  (C(2))  at  that  time,  it  identifies  the 
nature  of  the  QP  (C(l))  and  the  number  of  legitimate?  observations  in  tahe 
data  segment.     It  reports  the  number  of  iterations  of  the  smoothing  process 
(C(4))  and  the  order  of  the  polynomial  used  to  fit  the  data.   It  contains 
the  smoothed  value  XE  (C(6))  and  the  accompanying  value  of  FM(C(8)). 
Finally,  it  includes  the  residual  error  (C(7))  which  is  the  difference 
between  the  original  value  of  ::o  (C<2))  and  the  estimated  value  (C(6))  . 
Note  that  this  residual  error  can  differ  from  the  value  of  the  residual 
error  r (0)  in  cell  A(3,5)  of  Array  A  when  smoothing  is  iterated.   For 
example,  if  A(3,l)   (and  C(l))  has  the  value  -10,  then  the  residual  in  C(7) 
is  the  difference  between  the  outlier  value  of  >; o  in  C(2)  and  the  estimated 
value  x  e  in  C(6>  whereas  the  residual  error  in  A(3,5)  is  the?  difference 
between  the  estimated  value   xe(0)  in  cell  A(3,4)  after  the  last  iteration 
and  the  value  of  ::o(0)  in  cell  A  (3,2)  which  is  the  value  of  >:e(0)  in  cell 
A<3,4)  from  the  preceding  iteration. 


B.  THE  PROGRAM 

A  listing  o-f  the  BASIC  smoothing  program  -follows. 

BASIC  PROGRAM  LS7T3.DOC 

05  REM  LS7T3.DOC     .  17  Feb., 1985 

10  DIM  A  <  6  ,  5 )  ,  B(2,3),  C(8),  D(3),  D(5),  T(5) 

20  DATA  -3,  -2,  -1,  0,  1,  2,  3 

30  FOR  i=0  TO  6 

40  READ  A(i ,0) 

50  NEXT  i 

55  REM  STUDENT  T 

60  DATA  99.99,  6.314,  2.920,  2.353,  2.132,  2.105 

70  FOR  j=0  TO  5 

SO  READ  T(j) 

90  NEXT  j 

100  C(3)  =  7 

105  REM  ENTER  SMOOTHING  TIME 

110  C ( 0 )  =  21 60 

115  REM  ENTER  QUESTIONABLE  POINTS  (QP's) 

120  DATA  -10,  0,  0,  -2,  0,  0,  O 

130  FOR  i=0  TO  6 

140  READ  Ad  ,  1) 

150  NEXT  i 

160  C(l)  =  A (3, 1) 

165  REM  ENTER  OBSERVED  XO 

170  DATA  33238.6,   33267.7,   33313.5,   33374.2,  33434.8,  33510.5, 

33592 . 5 

180  FOR  i=0  TO  6 

190  READ  A(i ,2) 

200  NEXT  i 

210  C ( 2 )  «  A ( 3 , 2 ) 

215  REM  DETERMINE  NS 

220  FOR  i=0  TO  6 

230  IF  A (i,i)<0  THEN  C(3)=C<3)-1  ELSE  C(3)»C(3> 

240  NEXT  i 

245  REM  CHECK  IF  SMOOTHING  POSSIBLE 

250  IF  C<3><3  THEN  GO  TO  260  ELSE  GO  TO  290 

260  C(4>=0  :  C(5)=0   C(6)=C<2> 

270  C(7)=0  s  C(8)=T(0) 

280  GO  TO  1050 

290  C<4)=»0 

295  REM  START  SMOOTHING 

300  C<4)=C(4>+1  'ITERATION 

305  REM  ZERO  S  AND  D  CELLS 

310  FOR  j=0  TO  5 

320  S ( j ) =0 

330  NEXT  j 

340  FOR  j»0  TO  3 

350  D ( j ) =0 

360  NEXT  j 

363  REM  PRELIMINARY  CALCULATIONS 

366  REM  CALCULATE  XAV  AND  XM ' s 

370  FOR  i=0  TO  6 


S(0)=S(0)+A(i ,2) 

NEXT  i 

S(0)=S(0) /7 

FOR  i=0  TO  6 

A(i ,3)=A(i ,2)-S(0) 

NEXT  i 

REM  CALCULATE  SUMS  OF 

FOR  i«0  TO  6 

S(l)=S(l)+A(i ,3) *A(i ,0) 

S(2)=S(2)+A(i ,3) *A(i ,0) 

S  ( 3 ) =S ( 3 ) + A ( i  , 3 ) * ( A ( i  , 0 ) 

S ( 4 ) =S  <  4 ) +A  < l  , 3 ) *  <  A ( i  ,  0 ) 

NEXT  l 

S(5)=(28*S(4> 

REM  CALCULATE 

B<0, l)«C<3>-2 

SSR 1  =  ( 23*S  <  1  )  -S  <  2 )  --2 )  /  23 


PRODUCTS 


•2) 

•3) 


196*S<2> ) /28 
FM1 


B(0,2)=SQR(SSR1/B<0 


1  )  ) 

1) ) /SQR(C(3) ) 

FIT 


TO  690  ELSE  GO  TO 


:)*T(B<  1,1))  /SQR(C(3) 


XAV 


XM 


'SSX 
'SIX 
'S2X 
'S3X 

'S3X1 

'DPI 

'SDR1 
'FM1 

'DF2 
570 


'SDR2 
'FM2 

'DF3 
640 


B'(0,3)=B<0,2)  *T(B<0 

REM  CHECK  QUADRATIC 

B<  1  ,  1)=C (3) -3 

IF  B ( 1 , 1 ) < 1  THEN  GO 

REM  CALCULATE  FM2 

SSR2=  <84*SSR1-S  (3)  "-2)  /84 

B(l ,2)=SQR(SSR2/B( 1,1)) 

B(  1 ,3)=B( 1 , 

REM  CHECK  CUBIC  FIT 

B(2, l)=C<3)-4 

IF  B(2,1)<1  THEN  GO  TO  620  ELSE  GO  TO 

B(2,2)=0  :  B(2,3)=T(0) 

GO  TO  670 

REM  CALCULATE  FM3 

SSR3= ( 2 1 6*SSR2-S ( 5 ) A2 ) / 2 1 6 

B(2,2)=SQR(SSR3/B(2, 1) ) 

B(2,3)=B(2,2)*T(B<2, 1) ) /SQR(C(3) ) 

REM  SELECT  POLYNOMIAL 

IF  B(2,3) <B< 1 ,3)  AND  B(2,3) 

ELSE  GO  TO  680 

IF  B<  1,3X6(0,3)  THEN  GO  TO  770  ELSE  GO  TO  690 

REM  FIT  LINEAR 

C(5)=l  :  C(8)=B<0,3) 

REM  ESTABLISH  COEFFICIENTS 

D(1)=S(2) /23 

REM  DETERMINE  XE,  RES 

FOR  i=0  TO  6 

A(i ,4)=S(0)+D( 1 ) *A(i ,0) 

Ad  ,5)=A(i  ,2)-A(i  ,4) 

NEXT  i 

C(6)=A(3,4)  :  C ( 7) =C ( 2 ) -C ( 6) 

GO  TO  930 

REM  FIT  QUADRATIC 

CCS) -2  :  C(8)=B(1,3) 


'SDR3 
'  FM3 


B(0,3)  THEN  GO  TO  850 


:=1  FM=FM1 


b21 


res 


k=2  FM=FM2 


775  REM  ESTABLISH  COEFFICIENTS 

780  D(2)=S(3)/84  :  D(l)=  S(2)/28  :  D(0)=-4*D(2) 

785  REM  DETERMINE  XE,  RES 

790  FOR  i=0  TO  6 

800  A ( i , 4 ) =S ( 0 ) +D ( O ) +D ( 1 ) *A ( i , 0 ) +D ( 2 ) * ( A ( i , 0 )  -2 )      'xe 

810  A(i ,5) =A(i ,2>-A(i ,4)  'res 

820  NEXT  i 

830  C(6)=A(3,4)  :  C ( 7 ) =C ( 2) -C (6) 

840  GO  TO  930 

845  REM  FIT  CUBIC 

850  C(5)=3  :  C(8)=B(2,3)  '  k  =  3  FM=FM3 

355  REM  ESTABLISH  COEFFICIENTS 

860  D<3) =S(5) /216  s  D(2)=S(3)/84 

870  D( 1)=<S<2) -196*D<3) ) /28  :  D(0)=~4*D<2> 

875  REM  DETERMINE  XE  RES 

830  FOR  i-0  TO  6 

890   A  (  i  , 4  >  =S ( 0 )  +D  < 0  >  +D  (  .1  )  *A  (  i  ,  0  >  +D  (  2 )  *  <  A  ( i  ,  0  )  -2 >  +D  <  3  >  *  <  A ( i  , 0 )  A3  > 

900  A(i ,5)=A(i ,2)-A(i ,4)  'res 

910  NEXT  i 

920  C(6)=A<3,4)  :  C ( 7 ) =C (2) -C ( 6) 

925  REM  TERMINATE  IF  NOT  QP 

930  IF  A<3,1)=0  THEN  GO  TO  1050  ELSE  GO  TO  940 

935  REM  CHECK  RESIDUALS 

940  FOR  i=0  TO  6 

950  IF  A(l,i)<0  AND  ABS ( A ( i , 5) ) >1  THEN  GO  TO  980  ELSE  GO  TO  960 

960  NEXT  i 

970  GO  TO  1050 

980  IF  C(4)>9  THEN  GO  TO  1050  ELSE  GO  TO  990 

990  FOR  i=0  TO  6 

1000  IF  A(i,l)<0  THEN  A ( l , 2) =A ( i , 4 )  ELSE  A ( i , 2) =A ( i , 2) 

1010  NEXT  i 

1020  GO  TO  300 

1045  REM  TERMINATE 

1050  FOR  j=0  TO  8 

1060  LPRINT  C( j) ; 

1070  NEXT  j 

1080  LPRINT 

1090  STOP 


8 


C.  DISCUSSION  OF  THE  PROGRAM 

The  smoothing  program  can  now  be  discussed  in  some  detail. 

STEP  1.  Enter  the  segment  data. 

Be-fore  starting  the  smoothing  the  pertinent  information  on  the  data 
segment  must  be  entered.  This  includes  the  time  (program  line  110) ,  the 
QP's  (line  120),  and  the  observed  ::o's  (line  170). 

STEP  2.  Determine  NS. 

This  is  the  number  of  legitimate  observations  in  the  segment. 
(1 ines220-240) 

STEP  3 »  Ch ec  k  i  f  smoot h  i  n g  possible. 

1+   NS<3,  then  even  linear  polynomial  cannot  be  fitted  to  the  data 
segment  by  the  Least-Squares  Method.    (lines  250-280)   If  no  -fit  is 
passible,  then  terminate  (STEP  15). 

STEP  4.  Zero  iteration  counter  and  cells  in  Arrays  S  and  D. 

S   cells   cumulate   data   products   and   D   cells   contain  parameter 
coefficients,  (lines  290-360) 

STEP  5.  Calculate  xav  (S(0))  and  >;  m  '  s  (A(i,3). 

;:av   is   the   average   of  the  ;:o's  and  the   ;•:  m  '  s  are       the  differences 
::m  (  j  )  =:;a  (  j  ) -;;av.   This  reduces  computational  round-off  error, 
(lines  370-430) 

STEP  6.  Calculate  other  S ( j ) 's. 

These  are  sums  of  products  of  data  values,   (lines  440-500) 

STEP  7.   Determine  capability  of  linear  fit. 

Estab  1  i  sh  DF  1  ( B  ( 0  ,  1  )  )  ,  SDR  1  ( B  ( 0  ,  2 )  )  ,  and  FM 1  ( B  ( 0  ,  3 )  )  . 
(lines  510-540) 

STEP  8.  Check  quadratic  fit. 

If   NS  <  4   then   DF2  <  1  and  no  quadratic  fit  is  possible.   Otherwise 
Isstablish  SDR2  and  FM2.  (lines  550-590) 

STEP  9.  Check  cubic  fit. 

If  NS  <  5  then  DF3  <  1  and  no  cubic  fit  is  possible.   Otherwise 
establish  SDR3  and  FM3.  (lines  600-660) 

STEP  10.  Select  polynomial  for  fitting  data  segment. 
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If   FM3  <  FM2  and  FM3  <  FM1  then  select  cubic,  set  k  -  CCS)  =  3  and  fi 
cubic  in  STEP  13.   Otherwise,  if  FM2  <  FM1  then  select  quadratic,  set 
k  =  C(5)  =  2  and  fit.  quadratic  in  STEP  12.  Finally,  if  neither   of  the  abovi 
then  select  linear,  set  k  =  C<5)  =  1  and   fit  linear  in  STEP  11. 
(1 ines  670-680) . 

STEP  11.   Fit  linear  polynomial. 

Establish  linear  coefficients  (D(i)  's)  and  calculate  ::e(i)  's  and 
r(i)'s,  also  ::e(0)  in  cell  C(6)  and  res  in  cell  C(7).   (lines  690-750) 

STEP  12.   Fit  cubic  polynomial. 

Establish  quadratic  coefficients  and  calculate  estimates  and  residuals 
(lines  770-830) 

STEP  13.   Fit  cubic  polynomial. 

Establish  cubic  coefficients  and  calculate  estimates   and  residuals, 
(lines  850-920) 

STEP  14.   Check  QP's. 

If  xo(0)  is  legitimate  observation  (A(3,l)  ■*  0)  then  terminate  (STEP 
15).   Otherwise  check  number  of  iterations  (C(4)).   If  C(4)  =  10  then 
terminate.   Finally,  check  residuals  at  times  where  A(i,l)<0  (at  missing 
points  and  outliers).   If  any  of  these  <^re    greater  than  unity  in  magnitude 
then  repeat  the  smoothing  (STEP  4).   If  none  of  them  a.r^;    then  terminate, 
(lines  930-1020) 

STEP  15.   Terminate. 

Print  Array  C  and  STOP,   (lines  1050-1090) 
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IV.   APPLICATION 

The  example  used  to  illustrate  this  smoothing  program  is  the  sample 
selected  consists  of  40  paints  -from  times  t-2121  to  t=2160.   Additional 
points  at  both  ends  of  the  sample  are;    required  in  order  to  smooth  at  these 
times.   Appropriate  inputs  -For  this  sample  arts    given  in  the  -first  four 
columns  of  Table  1.    It  is  convenient  to  have  the  contents  of  Figure  1  on 
the  monitor  when  performing  the  data  smoothing.   Inputs  can  then  be  changed 
easily  by  making  the  appropriate  changes  in  lines  110,  120,  and  170.  This  is 
(particularly  convenient  when  smoothing  values  other  than  outliers  or  missing 
lvalues.   The  computer  printout  of  the  results  of  smoothing  this  sample  a.r<= 
shown  in  Figure  2  and  summarised  in  the  last  three  columns  of  Table  1. 

It   is   of  some  interest  to  compare  the  results  in  Table  1, 
established  using  the  BASIC  Program  LS7T3.BAS  on  an  IBM  PC,  with  the  results 
given  in  Table  1  of  Reference  7  which  were  established  using  a  Texas 
Instruments  calculator  (TI— 59) .   The  differences  can  be  attributed  to  the 
differences  in  the  number  of  places  used  in  the  computations. 

Operation   of  the  smoothing  program  can  be  examined  in   more  detail  by 
evising  the  program  to  version  LS7T4.BAS.   Application  of  this  revised 
urogram  to  the  treatment  of  the  missing  point  at  t:=2160  is  shown  in  Figure 
a.   This  figure  presents  the  elements  of  the  other  arrays  as  well  as  Array 

Several  aspects  of  the  smoothing  of  this  missing  point  a.r&    discussed 
Del ow. 

1.  The   initial  value  value  for  xo(0)  at  time  t=2160  which  is  entered 
in  A  (3,2)  is  the  temporary  value  xo  (0)  =  (xo  (--1  )  +xo  (  +  1  )  )  /2  =  33374  .  2. 
This  value  also  appears  in  C(2)  of  the  output.   The  value  shown  in 
A (3,2)  in  the  printout  of  Array  A  is  the  value  of  xe(0)  in  A (3,4) 
from  the  next  to  last  iteration. 

2.  The  initial  value  of  xo(-3)  entered  in  cell  A (0,2)  is  the  smoothed 
value  for  the  outlier  at  t~2157  which  was  established  when  that 
outlier  was  smoothed.   Again,  the  value  in  A (0,2)  in  the  printout 
of  Array  A  is  the  value  of  xe(-3)  in  A (0,4)  from  the  next  to  last 
i  terat i  on . 

3.  The  residual  error  r (0) =0.184  in  A (3,5)  is  the  difference  between 
the  values  of  xo(0)  in  A(3,2)  and  xe(0)  in  A(3,4)  whereas  the 
residual  error  in  C(7)  is  the  difference  between  the  temporarv 
value  of  xo(0)  initially  entered  in  A (3,2)  and  in  C(2)  and  the 
smoothed  value  xe(0)  in  A (3,4)  and  in  C(6). 

4.  In  each  iteration  of  the  smoothing  process  the  value  of  xo(0)  in 
A (3,2)  was  replaced  by  the  estimate  xe(0)  in  A (3, 4).   The 
difference  r(0)  shown  in  A (3,5)  is  r  (0  )  =x  a  (0) -x  e  (  0  )  which  is 
displayed  in  Array  A  as  A (3 , 5) =A (3 , 2) -A (3 , 4) =0. 2.   The  residual 
error  shown  in  C(7)  is  the  difference  between  the  on.gni.nal  value 
in  A (3,2)  which  is  saved  in  C(2)  and  the  final  estimate  xe(0)  in 
C(6)  and  in  A(3,4)  so  that  it  is  the  difference  between  the  initial 
temporary  value  and  the  final  smoothed  value, 

e.  , 
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C(7)=C(2)-C<6)=4.8. 

5.   In  the  same  way,  the  residual  error  in  A(-3,5)  is  the  difference 

A ( -3 , 5 ) =A  <  3 , 2  > -A ( 3 , 4 ) =0 .  1 

whereas  the  fitting  error  at  t=2157  is  the  difference  between  the 
initial  value  in  A(-3,2)  which  is  the  previously  smoothed  value 
(and  also  the  input  value)  and  the  smoothed  value  of  xe(-3)  in  A(- 
3,2)  so  that 

RES (-3) =33238. 6-33234. 5=4. 1. 

The  final  estimate  «e(~3)  in  A(-3,4)  is  consistent  with  the  7-point  data 
segment  centered  at  t=2160  whereas  the  initial  value  xo(-3)  is  consistent 
with  the  7-point  data  segment  centered  at  t=2157.   This  brings  up  the 
guest  ion  of  whether  previously  smoothed  values  should  be  iteratively 
smoothed  when  smoothing  other  points.   This  warrants  further  investigation 
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TREATMENT  OF  MISSING  POINT  AT  T  -  2160 


INPUT  OP'S 

-10   0   0   -2   0   0   0 

INPUT  XO'S 

33238.6   33267.7    33313.5    33374.2    33434.8    33510.5    33592.5 


ARRAY  C  (OUTPUT) 

2.160   -2   33374.2   5   3   3   33369.4   4.804688   1.941253 


ARRAY  A 

-3   -10 

33234.56 

-154.457 

33234.5 

.  0625 

-2    0 

33267.7 

-121. 3203 

33267 „ 94 

-.2421375 

-1    0 

33313.5 

-75.51953 

33313.25 

0   -2 

33369.58 

-19.44141 

33369. 4 

. 1835938 

1    0 

33434.8 

45.73125 

33435.35 

-.5507813 

2    0 

33510.5 

121.4805 

33510.  1 

. 4023438 

3    0 

33592.5 

203.4805 

33592.6 

-. 1015625 

ARRAY  B 

0    3 

0    2 
0    1 

26.00435 
1.34462 
.687484 

27.36421 

2.408822 
1 .941253 

ARRAY  S 

33389.02    102914.5    1680.715    412.1133   11728.02    -36.98433 

ARRAY  D 

1-19.62444  6  1.2241  4.906111  -.171224 
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TABLE  1 

Smoothing  Sample  NWS2AX1 

T  COLL         XO  QP       XE  RES  FM 

2117  3  33533.2  0  - 

2118  3  33567.7  0  - 

2119  3  33603.6  0        -  - 

2120  m  (33634.1)  -2  33634.9  (-0.8)  1.51 

2121  3  33664.4  0  33665.3  -0.7  1.53 

2122  3  33695.0  0  33693.5  1.5  1.41 

2123  3  33718.5  0  33720.9  -2.4  2.41 

2124  3  33745.8  0  33744.6  1.2  1.64 

2125  3  33767.1  0  33764.6  2.6  3.02 

2126  3  33780.7  0  33783.7  -3.0  3.13 

2127  3  33798.0  0  33799.7  -1.7  3.47 

2128  m  (33810.9)  -2  33816.3  (-5.4)  5.17 

2129  3  33823.7  0  33828.1  -4.4  4.73 

2130  3  33827.3  0  33821.6  5.7  8.35 

2131  3  33794.2  0  33786.6  7.6  11.23 

2132  3  33726.1  0  33724.1  2.1  2.46 

2133  3  33637.7  0  33640.1  -2.4  2.62 

2134  12  33556.5  0  33556.9  -0.4  5.07 

2135  12  33486.6  0  33490.0  -3.4  3.45 

2136  m  (33466.5)  -12  33452.3  (14.2)  4.61 

2137  3  33446.5  0  33451.2  -4.7  5.16 

2138  3  33485.1  0  33489.9  -4.8  8.82 

2139  3  33559.3  0  33560.9  -1.6  3.44 

2140  3  33650.2  0  33649.8  0.4  1.67 

2141  3  33738.5  0  33734.5  4.0  5.50 

2142  3  33799.0  0  33795.6  3.4  7.38 

2143  3  33825.8  0  33823.1  2.7  2.48 

2144  m  (33798.7)  -12  33813.0  (-14.3)  2.93 

2145  3  33771.6  0  33767.4  4.2  6.62 

2146  3  33698.3  0  33696.5  1.8  5.11 

2147  3  33607.7  0  33614.6  -6.9  9.35 

2148  12  33528.5  0  33530.0  -1.3  3.59 

2149  12  33455.2  0  33451.7  3.5  3.84 

2150  3  33381.2  0  33383.4  -2.2  2.74 

2151  3  33323.5  0  33325.2  -1.7  3.27 

2152  m  (33285.1)  -2  33277.9  (7.2)  2.36 

2153  3  33246.6  0  33243.5  3.1  3.02 

2154  3  33219.5  0  33222.1  -2.6  4.60 

2155  3  33212.5  0  33214.0  -1.5  4.64 

2156  3  33221.0  0  33218.8  2.2  2.63 

2157  3  33273.5  -10  33238.6  34.9  2.54 

2158  3  33267.7  0  33269.7  -2.0  2.51 

2159  3  33313.5  0  33313.1  0.4  3.27 

2160  m  (33374.2)  -2  33369.4  (4.3)  1.94 

2161  3  33434.8  0          -  -  - 

2162  3  33510.5  0          -  - 

2163  3  33592.5  0          -  - 
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FIGURE  1 
BASIC  SCREEN  FOR  SMOOTHING  DATA 

LPRINT  "SAMPLE  NWS2AX1" 

LPRINT  "OUTLIERS" 

_IST  110 

110  C<0)=2160 

I) 

_IST  120 

120  DATA  -10,  0,  0,  -2,  0,  0,  0 

:> 

_IST  170 

170  DATA  33238.6,  33267.7,  33313.5,  33374.2,  33434. S,  33510.5, 

'j 

RUN 

.PRINT   "MISSING  POINTS" 

.PRINT   "OTHER  POINTS" 


TREATMENT  OF  SAMPLE  NWS2A1X  USING  LS7T3.BAS 


OUTLIERS 
2136 
2144 
2157 

MISSING  POIN 
2120 
2128 
2152 
2 1 60 


12 

33466.5 

6 

4 

12 

33789.7 

6 

4 

10 

33273. 5 

5 

4 

S 

33634. 1 

6 

1 

2 

33810. 0 

6 

3 

33285. 1 

6 

"5 

'' 

33374.2 

5 

3 

33812.96 
33238.63 


33634.88 
33816.34 
33277.93 
33369.4 


14.22266 
14.25781 
34.875 


-.7734375 
-5.4375 
7. 175782 
4.804688 


4.610405 
2.9266.79 
2.541 182 


1.475461 
5. 16895 

xl  »  ■-<  k->  6  8  6  •.> 
1.941253 


OTHER  POINTS 


21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 

jU.  1 

21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
21 
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24 
25 
26 
27 
29 
30 
31 


34 
35 
37 
38 
39 
40 
41 
42 
43 
45 
46 
47 
48 
49 
50 
51 

54 
55 
56 
58 
59 


0 
0 
0 
0 

0 
0 
0 
0 
0 
0 
0 
0 
0 

o 

0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 

o 

0 

o 
o 

0 
0 
0 
0 


33664. 

6 

6    1 

33695 

6    1 

33718. 

5 

6    1 

33745. 

S 

7    1 

33767. 

1 

6    1 

33780. 

7 

6    1 

33798 

6    1 

33823. 

7 

6    1 

33827. 

2 

6    1 

33794. 

rj 

6    1 

33726. 

1 

7    1 

33637. 

7 

6    1 

33556. 

5 

6    1 

33486. 

6 

6    1 

33446. 

5 

6    1 

33485. 

1 

6    1 

33559. 

3 

6    1 

33650. 

';3 

7    1 

33738. 

6    1 

33799 

6    1 

33825. 

8 

6    1 

33771. 

6 

6     1 

53698. 

0' 

6     1 

33607. 

7 

6    1 

«'  O'  J^,Oi 

5 

7    1 

33455. 

2 

6    1 

33381. 

'-i 

X— 

6    1 

3-3-323 . 

5 

6    1 

33246. 

6 

6    1 

33219. 

5 

5    1 

*:;■  "T  '••)  -i  '"j 

5 

5    1 

vr  "h*  *"?  "t  •) 

6    1 

33267. 

7 

5    1 

33313. 

5 

5    1 

33665, 
33693, 
33720. 
33744, 
33764. 
33783, 
33799. 
33828, 
33821. 
33786, 
33724. 
33640, 
33556. 
33489, 
33451 . 
33489, 
33560. 
33649, 
33734. 
33795, 
33823. 


28 

54 

87 

58 

55 

69 

72 

12 

64 

6 

05 

13 

91 

96 

15 

95 
oo 

8 

47 
56 
1.3 


33767. 36 


33696, 
33614, 

"V  -T  KT  D  Q 

33451, 
33383, 
33325 
33243, 

33214, 
33218 
33269, 
33313 


Zi 

61 

77 

71 
39 
17 
54 
07 
04 

a 

67 
05 


-, 

6757813 

1  . 

526676 

1. 

460938 

1  . 

408332 

<-) 

371094 

<■} 

412839 

1 . 

226563 

1. 

636923 

r? 

550781 

yt 

020508 

'"? 

988281 

3 

133061 

i . 

722656 

"T 

467283 

4. 

421875 

3 

467283 

5. 

664063 

8. 

346241 

7. 

605469 

11. 

2  ^599 

050781 

--> 

459811 

433594 

■-> 

61768 

-. 

40625 

5. 

074126 

•_'  ■ 

359375 

3 

445967 

4. 

652344 

5. 

162057 

4. 

792969 

3. 

315254 

1. 

617138 

3 . 

443657 

3984375 

1. 

671507 

4. 

0312575 

5. 

498794 

™ 

445313 

7 . 

332694 

'"> 

671875 

<~y 

47908 

4. 

242188 

6. 

616293 

1. 

800781 

5. 

109362 

6. 

9 1 0 1 57 

9. 

347524 

1. 

269531 

■3 . 

590484 

~\ 

492188 

3 . 

333284 

O 

1875 

•~\ 

737984 

1. 

664063 

"T 

271413 

066406 

3 

016105 

570313 

4. 

597027 

1. 

542969 

4. 
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CONCLUSIONS  AND  RECOMMENDATIONS 

The   BASIC   program  LS7T3.BAS  presented  in  this  report   uses  the  Least- 
Squares  Method  to  select  a  polynomial  of  order  1,  2,  or  3  to  fit  7-point 
segments  of  data.    It  establishes  an  estimate  for  the  actual  value  at  the 
nidpoint  of  the  data   segment  and  a  Figure  of  Merit  (FM)  to  provide  an 
indication  of  the  quality  of  that  estimate. 

A  modified  version  of  this  program  (LS7T4.BAS)  is   available  for  a  more 
detailed  examination  of  the  application  of  the  program  to  any  7-point  data 
segment . 

It  is  recommended  that  this  program  be  used  for  smoothing  NUWES  3-0 
data.    It  is  also  recommended  that  this  program  be  transferred  to  a 
spresidsheet  format  for  use  with  LOTUS-123  with  the  intent  of  using  macros  to 
automate  its  application  to  the  extent  possible. 

Possible  modification  of  this  program  to  change  the  treatment  of 
multiple  questionable  points  in  a  data  segment  needs  further  examination  as 
suggested  in  Section  IV. 
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FLOW  CHART:  LS7T3  BASIC  PROGRAM 
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